Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 4416 |
| Missing cells | 2714 |
| Missing cells (%) | 4.4% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 483.1 KiB |
| Average record size in memory | 112.0 B |
Variable types
| DateTime | 1 |
|---|---|
| Numeric | 13 |
num_orders is highly correlated with month and 8 other fields | High correlation |
month is highly correlated with num_orders and 3 other fields | High correlation |
lag_168 is highly correlated with num_orders and 7 other fields | High correlation |
lag_336 is highly correlated with num_orders and 6 other fields | High correlation |
lag_504 is highly correlated with num_orders and 6 other fields | High correlation |
lag_672 is highly correlated with num_orders and 6 other fields | High correlation |
rolling_mean_336h is highly correlated with num_orders and 8 other fields | High correlation |
rolling_mean_168h is highly correlated with num_orders and 8 other fields | High correlation |
trend_lag168 is highly correlated with num_orders and 4 other fields | High correlation |
seasonal_lag168 is highly correlated with num_orders and 6 other fields | High correlation |
num_orders is highly correlated with month and 8 other fields | High correlation |
month is highly correlated with num_orders and 3 other fields | High correlation |
lag_168 is highly correlated with num_orders and 8 other fields | High correlation |
lag_336 is highly correlated with num_orders and 6 other fields | High correlation |
lag_504 is highly correlated with num_orders and 6 other fields | High correlation |
lag_672 is highly correlated with num_orders and 6 other fields | High correlation |
rolling_mean_336h is highly correlated with num_orders and 8 other fields | High correlation |
rolling_mean_168h is highly correlated with num_orders and 8 other fields | High correlation |
trend_lag168 is highly correlated with num_orders and 4 other fields | High correlation |
seasonal_lag168 is highly correlated with num_orders and 6 other fields | High correlation |
resid_lag168 is highly correlated with lag_168 | High correlation |
num_orders is highly correlated with lag_168 and 5 other fields | High correlation |
month is highly correlated with trend_lag168 | High correlation |
lag_168 is highly correlated with num_orders and 5 other fields | High correlation |
lag_336 is highly correlated with num_orders and 5 other fields | High correlation |
lag_504 is highly correlated with num_orders and 5 other fields | High correlation |
lag_672 is highly correlated with num_orders and 5 other fields | High correlation |
rolling_mean_336h is highly correlated with num_orders and 5 other fields | High correlation |
rolling_mean_168h is highly correlated with num_orders and 5 other fields | High correlation |
trend_lag168 is highly correlated with month | High correlation |
num_orders is highly correlated with hour and 9 other fields | High correlation |
month is highly correlated with trend_lag168 | High correlation |
hour is highly correlated with num_orders and 4 other fields | High correlation |
lag_168 is highly correlated with num_orders and 7 other fields | High correlation |
lag_336 is highly correlated with num_orders and 7 other fields | High correlation |
lag_504 is highly correlated with num_orders and 8 other fields | High correlation |
lag_672 is highly correlated with num_orders and 7 other fields | High correlation |
rolling_mean_336h is highly correlated with num_orders and 8 other fields | High correlation |
rolling_mean_168h is highly correlated with num_orders and 8 other fields | High correlation |
trend_lag168 is highly correlated with num_orders and 3 other fields | High correlation |
seasonal_lag168 is highly correlated with num_orders and 7 other fields | High correlation |
resid_lag168 is highly correlated with num_orders and 4 other fields | High correlation |
lag_168 has 168 (3.8%) missing values | Missing |
lag_336 has 336 (7.6%) missing values | Missing |
lag_504 has 504 (11.4%) missing values | Missing |
lag_672 has 672 (15.2%) missing values | Missing |
rolling_mean_336h has 337 (7.6%) missing values | Missing |
rolling_mean_168h has 169 (3.8%) missing values | Missing |
trend_lag168 has 180 (4.1%) missing values | Missing |
seasonal_lag168 has 168 (3.8%) missing values | Missing |
resid_lag168 has 180 (4.1%) missing values | Missing |
datetime has unique values | Unique |
dayofweek has 624 (14.1%) zeros | Zeros |
hour has 184 (4.2%) zeros | Zeros |
Reproduction
| Analysis started | 2022-03-22 11:04:53.727129 |
|---|---|
| Analysis finished | 2022-03-22 11:05:06.946933 |
| Duration | 13.22 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 4416 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.6 KiB |
| Minimum | 2018-03-01 00:00:00 |
|---|---|
| Maximum | 2018-08-31 23:00:00 |
| Distinct | 251 |
|---|---|
| Distinct (%) | 5.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 84.4227808 |
| Minimum | 0 |
|---|---|
| Maximum | 462 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 21 |
| Q1 | 54 |
| median | 78 |
| Q3 | 107 |
| 95-th percentile | 166 |
| Maximum | 462 |
| Range | 462 |
| Interquartile range (IQR) | 53 |
Descriptive statistics
| Standard deviation | 45.02385342 |
|---|---|
| Coefficient of variation (CV) | 0.5333140296 |
| Kurtosis | 3.76808057 |
| Mean | 84.4227808 |
| Median Absolute Deviation (MAD) | 26 |
| Skewness | 1.188955573 |
| Sum | 372811 |
| Variance | 2027.147377 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 73 | 65 | 1.5% |
| 57 | 58 | 1.3% |
| 66 | 58 | 1.3% |
| 78 | 54 | 1.2% |
| 84 | 52 | 1.2% |
| 77 | 51 | 1.2% |
| 83 | 50 | 1.1% |
| 80 | 49 | 1.1% |
| 69 | 48 | 1.1% |
| 61 | 48 | 1.1% |
| Other values (241) | 3883 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 1 | 3 | 0.1% |
| 2 | 6 | 0.1% |
| 3 | 5 | 0.1% |
| 4 | 5 | 0.1% |
| 5 | 10 | |
| 6 | 13 | |
| 7 | 15 | |
| 8 | 5 | 0.1% |
| 9 | 8 |
| Value | Count | Frequency (%) |
| 462 | 1 | < 0.1% |
| 437 | 1 | < 0.1% |
| 408 | 1 | < 0.1% |
| 342 | 1 | < 0.1% |
| 295 | 1 | < 0.1% |
| 281 | 2 | |
| 276 | 1 | < 0.1% |
| 273 | 3 | |
| 272 | 1 | < 0.1% |
| 268 | 1 | < 0.1% |
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.505434783 |
| Minimum | 3 |
|---|---|
| Maximum | 8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.6 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 4 |
| median | 5.5 |
| Q3 | 7 |
| 95-th percentile | 8 |
| Maximum | 8 |
| Range | 5 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.713306101 |
|---|---|
| Coefficient of variation (CV) | 0.3112026876 |
| Kurtosis | -1.274560483 |
| Mean | 5.505434783 |
| Median Absolute Deviation (MAD) | 1.5 |
| Skewness | -0.006006214519 |
| Sum | 24312 |
| Variance | 2.935417795 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 3 | 744 | |
| 5 | 744 | |
| 7 | 744 | |
| 8 | 744 | |
| 4 | 720 | |
| 6 | 720 |
| Value | Count | Frequency (%) |
| 3 | 744 | |
| 4 | 720 | |
| 5 | 744 | |
| 6 | 720 | |
| 7 | 744 | |
| 8 | 744 |
| Value | Count | Frequency (%) |
| 8 | 744 | |
| 7 | 744 | |
| 6 | 720 | |
| 5 | 744 | |
| 4 | 720 | |
| 3 | 744 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.005434783 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 624 |
| Zeros (%) | 14.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 1.990684396 |
|---|---|
| Coefficient of variation (CV) | 0.6623615349 |
| Kurtosis | -1.235249788 |
| Mean | 3.005434783 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.007504656223 |
| Sum | 13272 |
| Variance | 3.962824364 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 648 | |
| 4 | 648 | |
| 5 | 624 | |
| 6 | 624 | |
| 0 | 624 | |
| 1 | 624 | |
| 2 | 624 |
| Value | Count | Frequency (%) |
| 0 | 624 | |
| 1 | 624 | |
| 2 | 624 | |
| 3 | 648 | |
| 4 | 648 | |
| 5 | 624 | |
| 6 | 624 |
| Value | Count | Frequency (%) |
| 6 | 624 | |
| 5 | 624 | |
| 4 | 648 | |
| 3 | 648 | |
| 2 | 624 | |
| 1 | 624 | |
| 0 | 624 |
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.5 |
| Minimum | 0 |
|---|---|
| Maximum | 23 |
| Zeros | 184 |
| Zeros (%) | 4.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 5.75 |
| median | 11.5 |
| Q3 | 17.25 |
| 95-th percentile | 22 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 11.5 |
Descriptive statistics
| Standard deviation | 6.922970448 |
|---|---|
| Coefficient of variation (CV) | 0.6019974302 |
| Kurtosis | -1.20417852 |
| Mean | 11.5 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 0 |
| Sum | 50784 |
| Variance | 47.92751982 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 184 | 4.2% |
| 1 | 184 | 4.2% |
| 22 | 184 | 4.2% |
| 21 | 184 | 4.2% |
| 20 | 184 | 4.2% |
| 19 | 184 | 4.2% |
| 18 | 184 | 4.2% |
| 17 | 184 | 4.2% |
| 16 | 184 | 4.2% |
| 15 | 184 | 4.2% |
| Other values (14) | 2576 |
| Value | Count | Frequency (%) |
| 0 | 184 | |
| 1 | 184 | |
| 2 | 184 | |
| 3 | 184 | |
| 4 | 184 | |
| 5 | 184 | |
| 6 | 184 | |
| 7 | 184 | |
| 8 | 184 | |
| 9 | 184 |
| Value | Count | Frequency (%) |
| 23 | 184 | |
| 22 | 184 | |
| 21 | 184 | |
| 20 | 184 | |
| 19 | 184 | |
| 18 | 184 | |
| 17 | 184 | |
| 16 | 184 | |
| 15 | 184 | |
| 14 | 184 |
lag_168
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 234 |
|---|---|
| Distinct (%) | 5.5% |
| Missing | 168 |
| Missing (%) | 3.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 81.6584275 |
| Minimum | 0 |
|---|---|
| Maximum | 462 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 20 |
| Q1 | 53 |
| median | 77 |
| Q3 | 104 |
| 95-th percentile | 157 |
| Maximum | 462 |
| Range | 462 |
| Interquartile range (IQR) | 51 |
Descriptive statistics
| Standard deviation | 41.84639908 |
|---|---|
| Coefficient of variation (CV) | 0.5124565873 |
| Kurtosis | 3.717051696 |
| Mean | 81.6584275 |
| Median Absolute Deviation (MAD) | 26 |
| Skewness | 1.059515717 |
| Sum | 346885 |
| Variance | 1751.121116 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 73 | 65 | 1.5% |
| 57 | 58 | 1.3% |
| 66 | 58 | 1.3% |
| 78 | 53 | 1.2% |
| 84 | 52 | 1.2% |
| 77 | 51 | 1.2% |
| 83 | 50 | 1.1% |
| 80 | 49 | 1.1% |
| 69 | 48 | 1.1% |
| 61 | 48 | 1.1% |
| Other values (224) | 3716 | |
| (Missing) | 168 | 3.8% |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 1 | 3 | 0.1% |
| 2 | 6 | 0.1% |
| 3 | 5 | 0.1% |
| 4 | 5 | 0.1% |
| 5 | 10 | |
| 6 | 13 | |
| 7 | 15 | |
| 8 | 5 | 0.1% |
| 9 | 8 |
| Value | Count | Frequency (%) |
| 462 | 1 | |
| 437 | 1 | |
| 281 | 1 | |
| 273 | 2 | |
| 272 | 1 | |
| 254 | 1 | |
| 253 | 1 | |
| 251 | 1 | |
| 249 | 1 | |
| 248 | 1 |
lag_336
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 223 |
|---|---|
| Distinct (%) | 5.5% |
| Missing | 336 |
| Missing (%) | 7.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 79.53848039 |
| Minimum | 0 |
|---|---|
| Maximum | 437 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 20 |
| Q1 | 52.75 |
| median | 76 |
| Q3 | 102 |
| 95-th percentile | 152 |
| Maximum | 437 |
| Range | 437 |
| Interquartile range (IQR) | 49.25 |
Descriptive statistics
| Standard deviation | 39.61629508 |
|---|---|
| Coefficient of variation (CV) | 0.4980770928 |
| Kurtosis | 2.408662678 |
| Mean | 79.53848039 |
| Median Absolute Deviation (MAD) | 25 |
| Skewness | 0.8510084143 |
| Sum | 324517 |
| Variance | 1569.450836 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 73 | 63 | 1.4% |
| 66 | 58 | 1.3% |
| 57 | 58 | 1.3% |
| 78 | 53 | 1.2% |
| 84 | 52 | 1.2% |
| 83 | 50 | 1.1% |
| 77 | 50 | 1.1% |
| 61 | 48 | 1.1% |
| 80 | 48 | 1.1% |
| 62 | 47 | 1.1% |
| Other values (213) | 3553 | |
| (Missing) | 336 | 7.6% |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 1 | 3 | 0.1% |
| 2 | 6 | 0.1% |
| 3 | 5 | 0.1% |
| 4 | 5 | 0.1% |
| 5 | 10 | |
| 6 | 13 | |
| 7 | 15 | |
| 8 | 5 | 0.1% |
| 9 | 8 |
| Value | Count | Frequency (%) |
| 437 | 1 | |
| 273 | 1 | |
| 253 | 1 | |
| 251 | 1 | |
| 249 | 1 | |
| 248 | 1 | |
| 245 | 1 | |
| 234 | 1 | |
| 231 | 1 | |
| 230 | 1 |
lag_504
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 213 |
|---|---|
| Distinct (%) | 5.4% |
| Missing | 504 |
| Missing (%) | 11.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 77.59458078 |
| Minimum | 0 |
|---|---|
| Maximum | 253 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 19 |
| Q1 | 52 |
| median | 74 |
| Q3 | 100 |
| 95-th percentile | 145.45 |
| Maximum | 253 |
| Range | 253 |
| Interquartile range (IQR) | 48 |
Descriptive statistics
| Standard deviation | 37.7079744 |
|---|---|
| Coefficient of variation (CV) | 0.4859614425 |
| Kurtosis | 0.8456628968 |
| Mean | 77.59458078 |
| Median Absolute Deviation (MAD) | 24 |
| Skewness | 0.6377937443 |
| Sum | 303550 |
| Variance | 1421.891333 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 73 | 62 | 1.4% |
| 57 | 58 | 1.3% |
| 66 | 57 | 1.3% |
| 78 | 51 | 1.2% |
| 77 | 50 | 1.1% |
| 83 | 49 | 1.1% |
| 84 | 49 | 1.1% |
| 61 | 48 | 1.1% |
| 80 | 48 | 1.1% |
| 74 | 46 | 1.0% |
| Other values (203) | 3394 | |
| (Missing) | 504 | 11.4% |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 1 | 3 | 0.1% |
| 2 | 6 | 0.1% |
| 3 | 5 | 0.1% |
| 4 | 5 | 0.1% |
| 5 | 10 | |
| 6 | 13 | |
| 7 | 15 | |
| 8 | 5 | 0.1% |
| 9 | 8 |
| Value | Count | Frequency (%) |
| 253 | 1 | |
| 251 | 1 | |
| 248 | 1 | |
| 245 | 1 | |
| 234 | 1 | |
| 230 | 1 | |
| 229 | 1 | |
| 224 | 1 | |
| 223 | 2 | |
| 222 | 1 |
lag_672
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 206 |
|---|---|
| Distinct (%) | 5.5% |
| Missing | 672 |
| Missing (%) | 15.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 76.22622863 |
| Minimum | 0 |
|---|---|
| Maximum | 253 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 18 |
| Q1 | 51 |
| median | 73 |
| Q3 | 98 |
| 95-th percentile | 143 |
| Maximum | 253 |
| Range | 253 |
| Interquartile range (IQR) | 47 |
Descriptive statistics
| Standard deviation | 36.87755256 |
|---|---|
| Coefficient of variation (CV) | 0.4837908581 |
| Kurtosis | 0.7875990059 |
| Mean | 76.22622863 |
| Median Absolute Deviation (MAD) | 23 |
| Skewness | 0.6105734582 |
| Sum | 285391 |
| Variance | 1359.953883 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 73 | 62 | 1.4% |
| 57 | 58 | 1.3% |
| 66 | 57 | 1.3% |
| 78 | 49 | 1.1% |
| 77 | 49 | 1.1% |
| 84 | 47 | 1.1% |
| 61 | 47 | 1.1% |
| 83 | 46 | 1.0% |
| 48 | 46 | 1.0% |
| 74 | 45 | 1.0% |
| Other values (196) | 3238 | |
| (Missing) | 672 | 15.2% |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 1 | 3 | 0.1% |
| 2 | 6 | 0.1% |
| 3 | 5 | 0.1% |
| 4 | 5 | 0.1% |
| 5 | 10 | |
| 6 | 13 | |
| 7 | 15 | |
| 8 | 5 | 0.1% |
| 9 | 8 |
| Value | Count | Frequency (%) |
| 253 | 1 | |
| 251 | 1 | |
| 245 | 1 | |
| 234 | 1 | |
| 229 | 1 | |
| 224 | 1 | |
| 223 | 1 | |
| 222 | 1 | |
| 216 | 1 | |
| 215 | 1 |
rolling_mean_336h
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 370 |
|---|---|
| Distinct (%) | 9.1% |
| Missing | 337 |
| Missing (%) | 7.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 79.52378034 |
| Minimum | 2.5 |
|---|---|
| Maximum | 355 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.6 KiB |
Quantile statistics
| Minimum | 2.5 |
|---|---|
| 5-th percentile | 24.5 |
| Q1 | 56 |
| median | 76.5 |
| Q3 | 100 |
| 95-th percentile | 141.5 |
| Maximum | 355 |
| Range | 352.5 |
| Interquartile range (IQR) | 44 |
Descriptive statistics
| Standard deviation | 35.12166444 |
|---|---|
| Coefficient of variation (CV) | 0.4416498346 |
| Kurtosis | 1.570175869 |
| Mean | 79.52378034 |
| Median Absolute Deviation (MAD) | 22 |
| Skewness | 0.6474311148 |
| Sum | 324377.5 |
| Variance | 1233.531313 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 69.5 | 34 | 0.8% |
| 75 | 33 | 0.7% |
| 79 | 32 | 0.7% |
| 60.5 | 31 | 0.7% |
| 62.5 | 30 | 0.7% |
| 65.5 | 30 | 0.7% |
| 55.5 | 30 | 0.7% |
| 72.5 | 30 | 0.7% |
| 77 | 30 | 0.7% |
| 78 | 30 | 0.7% |
| Other values (360) | 3769 | |
| (Missing) | 337 | 7.6% |
| Value | Count | Frequency (%) |
| 2.5 | 3 | |
| 3.5 | 1 | < 0.1% |
| 4.5 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 5.5 | 3 | |
| 6 | 3 | |
| 7 | 2 | < 0.1% |
| 7.5 | 1 | < 0.1% |
| 8 | 2 | < 0.1% |
| 8.5 | 7 |
| Value | Count | Frequency (%) |
| 355 | 1 | |
| 302.5 | 1 | |
| 237.5 | 1 | |
| 216.5 | 1 | |
| 215.5 | 1 | |
| 211.5 | 1 | |
| 209 | 1 | |
| 205.5 | 1 | |
| 200.5 | 1 | |
| 200 | 1 |
rolling_mean_168h
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 390 |
|---|---|
| Distinct (%) | 9.2% |
| Missing | 169 |
| Missing (%) | 3.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 81.64268896 |
| Minimum | 2.5 |
|---|---|
| Maximum | 367 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.6 KiB |
Quantile statistics
| Minimum | 2.5 |
|---|---|
| 5-th percentile | 25.15 |
| Q1 | 56.5 |
| median | 77.5 |
| Q3 | 102 |
| 95-th percentile | 147.85 |
| Maximum | 367 |
| Range | 364.5 |
| Interquartile range (IQR) | 45.5 |
Descriptive statistics
| Standard deviation | 37.3685781 |
|---|---|
| Coefficient of variation (CV) | 0.4577088111 |
| Kurtosis | 2.723833115 |
| Mean | 81.64268896 |
| Median Absolute Deviation (MAD) | 22.5 |
| Skewness | 0.886100439 |
| Sum | 346736.5 |
| Variance | 1396.410629 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 69.5 | 34 | 0.8% |
| 75 | 33 | 0.7% |
| 79 | 32 | 0.7% |
| 78 | 31 | 0.7% |
| 60.5 | 31 | 0.7% |
| 54.5 | 30 | 0.7% |
| 77 | 30 | 0.7% |
| 55.5 | 30 | 0.7% |
| 62.5 | 30 | 0.7% |
| 72.5 | 30 | 0.7% |
| Other values (380) | 3936 | |
| (Missing) | 169 | 3.8% |
| Value | Count | Frequency (%) |
| 2.5 | 3 | |
| 3.5 | 1 | < 0.1% |
| 4.5 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 5.5 | 3 | |
| 6 | 3 | |
| 7 | 2 | < 0.1% |
| 7.5 | 1 | < 0.1% |
| 8 | 2 | < 0.1% |
| 8.5 | 7 |
| Value | Count | Frequency (%) |
| 367 | 1 | |
| 355 | 1 | |
| 343 | 1 | |
| 302.5 | 1 | |
| 248.5 | 2 | |
| 237.5 | 1 | |
| 236 | 1 | |
| 235 | 1 | |
| 224 | 1 | |
| 218 | 1 |
trend_lag168
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 3450 |
|---|---|
| Distinct (%) | 81.4% |
| Missing | 180 |
| Missing (%) | 4.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 81.73669145 |
| Minimum | 42.45833333 |
|---|---|
| Maximum | 159.4166667 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.6 KiB |
Quantile statistics
| Minimum | 42.45833333 |
|---|---|
| 5-th percentile | 51.09895833 |
| Q1 | 63.16666667 |
| median | 78.5625 |
| Q3 | 96.83854167 |
| 95-th percentile | 127.1458333 |
| Maximum | 159.4166667 |
| Range | 116.9583333 |
| Interquartile range (IQR) | 33.671875 |
Descriptive statistics
| Standard deviation | 23.03692575 |
|---|---|
| Coefficient of variation (CV) | 0.2818431397 |
| Kurtosis | -0.06197980381 |
| Mean | 81.73669145 |
| Median Absolute Deviation (MAD) | 16.71875 |
| Skewness | 0.6482450374 |
| Sum | 346236.625 |
| Variance | 530.6999479 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 61.22916667 | 6 | 0.1% |
| 82.0625 | 6 | 0.1% |
| 64.5 | 5 | 0.1% |
| 100.2708333 | 5 | 0.1% |
| 69.70833333 | 5 | 0.1% |
| 89.77083333 | 5 | 0.1% |
| 63.0625 | 4 | 0.1% |
| 68.29166667 | 4 | 0.1% |
| 68.95833333 | 4 | 0.1% |
| 68.45833333 | 4 | 0.1% |
| Other values (3440) | 4188 | |
| (Missing) | 180 | 4.1% |
| Value | Count | Frequency (%) |
| 42.45833333 | 2 | |
| 42.75 | 2 | |
| 42.8125 | 1 | |
| 42.83333333 | 1 | |
| 43.14583333 | 1 | |
| 43.16666667 | 1 | |
| 43.45833333 | 1 | |
| 43.64583333 | 1 | |
| 43.85416667 | 1 | |
| 44.0625 | 1 |
| Value | Count | Frequency (%) |
| 159.4166667 | 1 | |
| 158.8541667 | 1 | |
| 158.3958333 | 1 | |
| 157.6041667 | 1 | |
| 156.5833333 | 1 | |
| 156.4791667 | 1 | |
| 156.125 | 1 | |
| 156.0416667 | 1 | |
| 155.8333333 | 1 | |
| 155.7708333 | 1 |
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 168 |
| Missing (%) | 3.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.563980243 × 10-16 |
| Minimum | -59.18267114 |
|---|---|
| Maximum | 60.2481121 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 1947 |
| Negative (%) | 44.1% |
| Memory size | 34.6 KiB |
Quantile statistics
| Minimum | -59.18267114 |
|---|---|
| 5-th percentile | -55.10446076 |
| Q1 | -11.74773262 |
| median | 3.261488692 |
| Q3 | 14.66896251 |
| 95-th percentile | 29.60410026 |
| Maximum | 60.2481121 |
| Range | 119.4307832 |
| Interquartile range (IQR) | 26.41669513 |
Descriptive statistics
| Standard deviation | 26.15670324 |
|---|---|
| Coefficient of variation (CV) | 3.054269452 × 1016 |
| Kurtosis | 0.5671612721 |
| Mean | 8.563980243 × 10-16 |
| Median Absolute Deviation (MAD) | 14.54912341 |
| Skewness | -0.3639205388 |
| Sum | 2.728484105 × 10-12 |
| Variance | 684.1731244 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -2.133376973 | 177 | 4.0% |
| -16.72160557 | 177 | 4.0% |
| 13.59681428 | 177 | 4.0% |
| 10.67024325 | 177 | 4.0% |
| -2.696560033 | 177 | 4.0% |
| -7.124269505 | 177 | 4.0% |
| -15.52795803 | 177 | 4.0% |
| 7.920015559 | 177 | 4.0% |
| 29.60410026 | 177 | 4.0% |
| 3.731832498 | 177 | 4.0% |
| Other values (14) | 2478 |
| Value | Count | Frequency (%) |
| -59.18267114 | 177 | |
| -55.10446076 | 177 | |
| -41.56302178 | 177 | |
| -16.72160557 | 177 | |
| -15.52795803 | 177 | |
| -13.35241158 | 177 | |
| -11.21283963 | 177 | |
| -9.191664769 | 177 | |
| -7.124269505 | 177 | |
| -2.696560033 | 177 |
| Value | Count | Frequency (%) |
| 60.2481121 | 177 | |
| 29.60410026 | 177 | |
| 28.98274325 | 177 | |
| 25.00050281 | 177 | |
| 20.17707385 | 177 | |
| 17.88540718 | 177 | |
| 13.59681428 | 177 | |
| 10.67024325 | 177 | |
| 8.759268746 | 177 | |
| 7.920015559 | 177 |
| Distinct | 4185 |
|---|---|
| Distinct (%) | 98.8% |
| Missing | 180 |
| Missing (%) | 4.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.01476252105 |
| Minimum | -71.49947822 |
|---|---|
| Maximum | 279.3714234 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 2216 |
| Negative (%) | 50.2% |
| Memory size | 34.6 KiB |
Quantile statistics
| Minimum | -71.49947822 |
|---|---|
| 5-th percentile | -35.23351169 |
| Q1 | -14.54831702 |
| median | -1.376015103 |
| Q3 | 12.79765293 |
| 95-th percentile | 37.64481823 |
| Maximum | 279.3714234 |
| Range | 350.8709016 |
| Interquartile range (IQR) | 27.34596995 |
Descriptive statistics
| Standard deviation | 23.62257685 |
|---|---|
| Coefficient of variation (CV) | -1600.172272 |
| Kurtosis | 9.719783155 |
| Mean | -0.01476252105 |
| Median Absolute Deviation (MAD) | 13.59437614 |
| Skewness | 1.256619629 |
| Sum | -62.53403916 |
| Variance | 558.026137 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -11.1693325 | 3 | 0.1% |
| -8.375730495 | 2 | < 0.1% |
| 0.2541647693 | 2 | < 0.1% |
| 23.55209282 | 2 | < 0.1% |
| -5.728644885 | 2 | < 0.1% |
| -7.714414466 | 2 | < 0.1% |
| 13.39635512 | 2 | < 0.1% |
| 3.146355115 | 2 | < 0.1% |
| 9.979460762 | 2 | < 0.1% |
| -23.57598095 | 2 | < 0.1% |
| Other values (4175) | 4215 | |
| (Missing) | 180 | 4.1% |
| Value | Count | Frequency (%) |
| -71.49947822 | 1 | |
| -67.92024325 | 1 | |
| -67.83690991 | 1 | |
| -67.14940991 | 1 | |
| -63.41477876 | 1 | |
| -62.21008842 | 1 | |
| -61.45826692 | 1 | |
| -60.20120864 | 1 | |
| -59.0742733 | 1 | |
| -59.02440991 | 1 |
| Value | Count | Frequency (%) |
| 279.3714234 | 1 | |
| 269.5172568 | 1 | |
| 139.9490646 | 1 | |
| 123.0643879 | 1 | |
| 114.2604262 | 1 | |
| 113.7296884 | 1 | |
| 98.20475675 | 1 | |
| 97.90625949 | 1 | |
| 97.46875949 | 1 | |
| 91.04218845 | 1 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| datetime | num_orders | month | dayofweek | hour | lag_168 | lag_336 | lag_504 | lag_672 | rolling_mean_336h | rolling_mean_168h | trend_lag168 | seasonal_lag168 | resid_lag168 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2018-03-01 00:00:00 | 124 | 3 | 3 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1 | 2018-03-01 01:00:00 | 85 | 3 | 3 | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 2 | 2018-03-01 02:00:00 | 71 | 3 | 3 | 2 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 3 | 2018-03-01 03:00:00 | 66 | 3 | 3 | 3 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 4 | 2018-03-01 04:00:00 | 43 | 3 | 3 | 4 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 5 | 2018-03-01 05:00:00 | 6 | 3 | 3 | 5 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 6 | 2018-03-01 06:00:00 | 12 | 3 | 3 | 6 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 7 | 2018-03-01 07:00:00 | 15 | 3 | 3 | 7 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 8 | 2018-03-01 08:00:00 | 34 | 3 | 3 | 8 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9 | 2018-03-01 09:00:00 | 69 | 3 | 3 | 9 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
Last rows
| datetime | num_orders | month | dayofweek | hour | lag_168 | lag_336 | lag_504 | lag_672 | rolling_mean_336h | rolling_mean_168h | trend_lag168 | seasonal_lag168 | resid_lag168 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4406 | 2018-08-31 14:00:00 | 133 | 8 | 4 | 14 | 88.0 | 126.0 | 92.0 | 75.0 | 122.0 | 116.0 | 143.875000 | -9.191665 | -46.683335 |
| 4407 | 2018-08-31 15:00:00 | 116 | 8 | 4 | 15 | 117.0 | 104.0 | 106.0 | 96.0 | 115.0 | 102.5 | 145.750000 | 3.731832 | -32.481832 |
| 4408 | 2018-08-31 16:00:00 | 197 | 8 | 4 | 16 | 188.0 | 204.0 | 174.0 | 140.0 | 154.0 | 152.5 | 149.645833 | 29.604100 | 8.750066 |
| 4409 | 2018-08-31 17:00:00 | 217 | 8 | 4 | 17 | 170.0 | 165.0 | 138.0 | 142.0 | 184.5 | 179.0 | 153.854167 | 7.920016 | 8.225818 |
| 4410 | 2018-08-31 18:00:00 | 207 | 8 | 4 | 18 | 137.0 | 139.0 | 131.0 | 91.0 | 152.0 | 153.5 | 156.041667 | -15.527958 | -3.513709 |
| 4411 | 2018-08-31 19:00:00 | 136 | 8 | 4 | 19 | 113.0 | 84.0 | 98.0 | 91.0 | 111.5 | 125.0 | 155.833333 | -7.124270 | -35.709064 |
| 4412 | 2018-08-31 20:00:00 | 154 | 8 | 4 | 20 | 179.0 | 126.0 | 114.0 | 87.0 | 105.0 | 146.0 | 155.770833 | -2.696560 | 25.925727 |
| 4413 | 2018-08-31 21:00:00 | 159 | 8 | 4 | 21 | 166.0 | 144.0 | 143.0 | 123.0 | 135.0 | 172.5 | 156.583333 | 10.670243 | -1.253577 |
| 4414 | 2018-08-31 22:00:00 | 223 | 8 | 4 | 22 | 242.0 | 167.0 | 188.0 | 170.0 | 155.5 | 204.0 | 155.479167 | 13.596814 | 72.924019 |
| 4415 | 2018-08-31 23:00:00 | 205 | 8 | 4 | 23 | 173.0 | 155.0 | 162.0 | 123.0 | 161.0 | 207.5 | 152.833333 | 25.000503 | -4.833836 |